CDS

Accession Number TCMCG019C13424
gbkey CDS
Protein Id XP_022942730.1
Location complement(join(5518825..5519184,5519492..5519616,5519699..5519777,5519865..5520038,5520262..5520342,5520440..5520520,5520685..5520750,5520832..5520897,5523603..5523653,5523770..5523862,5525099..5525226,5525339..5525477,5526246..5526343,5526431..5526500,5526652..5526747))
Gene LOC111447680
GeneID 111447680
Organism Cucurbita moschata

Protein

Length 568aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023086962.1
Definition putative clathrin assembly protein At2g01600 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category TU
Description clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAATATGGCTACGCTTCAGACATGGAGGAAAGCTTATGGCGCTCTCAAGGATTCTACCAAAGTCGGCCTTGCCCATGTCAATAGCGATTATGCGGATTTGGATGTGGCAATAGTCAAAGCTACTAACCACGTCGAGTGCCCGCCGAAAGAGAGACACCTCAGGAAAATTTTGGTTGCTACATCGGCAATCAGGCCCCGTGCTGATGTTGCTTATTGCATTCATGCCCTTGCCCGACGATTGTCCAAGACGCGTAATTGGACGGTAGCTTTGAAAGCGTTAATAGTCATACATAGGACCTTGAGGGAGGGCGACCCAACATTCAGGGAAGAACTTTTGAATTTTACACAAAGAGCTAAAATCCTTCAATTATCGAATTTTAAGGATGATTCAAGTCCTATTGCTTGGGATTGCTCTGCATGGGTACGTACATATGCATTGTTTTTAGAGGAGCGACTCGAATGTTTCAGGATACTGAAGTATGACATTGAATCTGAACGCCTGCCAAGACCTGCCCAAGGTCAGGATAAGGGCTACAGCAGAACCAGGGAACTGGACAGTGAAGAACTGTTGGAACATTTGCCTGCTTTGCAACAGCTGTTGTATCGTCTTATTGGCTGCAAGCCGGAAGGAGCAGCTATTGGGAATTATGTTATACAGTATGCCTTGGCACTGGTATTGAAAGAGAGCTTTAAAATCTATTGTGCTATTAATGATGGAATTATAAATCTTGTTGACAAGTTTTTTGAGATGCCAAGGCACGAGGCTATCAAAGCCCTTGATATCTATAAAAGAGCTGGCCAACAGGCAGGAAGCCTATCAGATTTCTATGATATTTGCAAAGGGTTAGAACTTGCTCGGAATTTCCAGTTTCCTGTTTTAAGAGAGCCCCCACAGTCATTTCTTAATACGATGGAAGAGTATATTAGGGAGGCACCACGAGTTGTTACAGTACCAAATGAACCACTGCTGCAACTTACTTACAAGCCGGAAGACTCTCCTTCAGAAGATCCGAACTTACCAACAGATGAACCAGAGGCTTCTCCTTCAGATGATCTCTCTATTACTCCTGTTGAGACGGTTCCAGCACCACCTCCAGCACCAGCTCCTGCTCCAACCCATTCAGAGACTGGAGATTTATTGGGATTGAGTCTTGCTACCACAGAAGTATCCGCCATTGAGGAGAGAAATGCTTTGGCTTTAGCTATAGTTCCTTCTGGTGATGCAGCAGCGTCCACTTTTCATTCCAATGGTGTGCAGGCAAAGGATTTCGATCCTACTGGTTGGGAGCTAGCCTTGGTCACCACTCCAAGTGGTAACCTTTCATCAGCTAATGAGAGACAACTGGCTGGTGGGTTGGACACGCTCACTCTCGATAGTTTATATGATGAAGGTGCATATAGAGCTTCTCTACAGCCAGTGTACGGTAAGCCTGCACCAAACCCATTTGAAGTGCAAGATCCGTTTGCATATTCAAATGCCGTCGCTCCACCTCCATCGGTTCAAATGGCGCCACTAGCTCAGCAGCAGGCGGCGAATCCTTTTGGTCCTTACCAGCCCACCTTCCCACAGCAGCAACAACAACCATTTACAATGGACCCAACAAATCCTTTTGGTGATGCAGGTTTCAGTGCATTCCCTGCTCCTAACCACCATACCGTACCTCCACCCACAAGCAATCCATTCGGAAGCACAGGCCTGCTGTAG
Protein:  
MNMATLQTWRKAYGALKDSTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILVATSAIRPRADVAYCIHALARRLSKTRNWTVALKALIVIHRTLREGDPTFREELLNFTQRAKILQLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQDKGYSRTRELDSEELLEHLPALQQLLYRLIGCKPEGAAIGNYVIQYALALVLKESFKIYCAINDGIINLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFLNTMEEYIREAPRVVTVPNEPLLQLTYKPEDSPSEDPNLPTDEPEASPSDDLSITPVETVPAPPPAPAPAPTHSETGDLLGLSLATTEVSAIEERNALALAIVPSGDAAASTFHSNGVQAKDFDPTGWELALVTTPSGNLSSANERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPNPFEVQDPFAYSNAVAPPPSVQMAPLAQQQAANPFGPYQPTFPQQQQQPFTMDPTNPFGDAGFSAFPAPNHHTVPPPTSNPFGSTGLL